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^~j' Abstract. We prove that a random automaton with n states and any 

C^ ' fixed non-singleton alphabet is synchronizing with high probability. More- 

over, we also prove that the convergence speed is exactly 1 — 6'(-) as 
conjectured by Cameron 2 for the most interesting 2-letter alphabet 
^~~^ ' case. 

(N 

. 1 , 1 Synchronizing automata 

tyj ' Suppose ^ is a complete deterministic finite automaton whose input alphabet 

O I is A and whose state set is Q. The automaton £/ is called synchronizing if there 

exists a word w dz A* whose action resets ^, that is, w leaves the automaton in 

l/^ I one particular state no matter at which state in Q it is applied: q.w — q' .w for all 

^ ' q,q' & Q. Any such word w is called reset (or synchronizing) for the automaton. 

\l I The minimum length of reset words for a given automaton j^/ is called the reset 

length of £/ and is denoted by €{£/). 
l/-\ I Synchronizing automata serve as transparent and natural models of error- 

l' ■ resistant systems in many applications (coding theory, robotics, testing of reac- 

^— V I tive systems) and also reveal interesting connections with symbolic dynamics and 

fT^ ■ other parts of mathematics. For a brief introduction to the theory of synchro- 

nizing automata we refer the reader to the recent survey [10]. Here we discuss 
the probability for automata to be synchronizable. 

We take an example of such model from [1] . Imagine that you are in a dungeon 
consisting of a number of interconnected caves, all of which appear identical. 
S I Each cave has a common number of one-way doors of different colors through 

" " which you may leave; these lead to passages to other caves. There is one more 

door in each cave; in one cave the extra door leads to freedom, in all the others 
to instant death. You have a map of the dungeon with the escape door identified, 
but you do not know in which cave you are. If you are lucky, there is a sequence 
of doors through which you may pass which take you to the escape cave from 
any starting point. 

The main result of this paper is quite positive; we prove that the probability 
that if a dungeon has been chosen uniformly at random for a fixed number of 
colors t > 2 then there is a life-saving sequence with high probability, namely 
with the probaility 1 — 0(— ) where n is the number of caves. Moreover, we prove 
that the convergence speed is tight for i = 2. 
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This question has been extensively studied and the best result in this direc- 
tion has been obtained in |12 . They proved that the probability that a 4 letter 
random automaton is synchronizing is positive and if the number of letters grows 
together with n as 72 ln(n) then a random automaton is synchronizing with high 
probability. 

Let Q stand for {1,2, . ..n} and En stand for the probability space of all 
(unambigious) functions from Q to Q with uniform probability distribution. Let 
^ = (<9; {«: b}) be a random automaton whose state set Q equals {1, 2, . . . n}, 
that is a and b are chosen independently from S„. 

The underlying graph of a given automata ^ = {Q^U,d) is the digraph 
whose vertex set is Q and whose edge multiset is {{q,S{q,a)) \ q & Q, a & S}. 
In other words, the underlying graph of an automaton is obtained by erasing 
all labels from the arrows of the automaton. An example of automaton and its 
underlying graph is presented on the figure [TJ The underlying graph of a letter 
a G S is the underlying graph of an automaton ^a = {Q, {a}, S). It is clear that 
each directed graphs with constant out-degree 1 corresponds to the unique map 
from Sn and thus we can mean En also as the probability space with uniform 
distribution on all directed graphs with constant out-degree 1. 

We also need a definition of stable pairs which was crucial in the solution 
of the so-called Road Coloring Problem 9 . A pair of (different) states {p,q} is 
called stable if for any word u there is a word v such that p.uv = q.uv. The 
stability relation p is stable under the actions of all letters and this relation is 
full whenever .s^f is synchronizing. The opposite definition is as follows. A pair of 
states {p, q} is called deadlock if there is no word s such that p.s — q.s. A subset 
A C Q is called _F-cliques of £/ if it is a maximal by size set such that each 
pair of states from A is deadlock. It follows from the definition that all i^-cliques 
have the same size. 





Fig. 1. The automaton "^4 and its underlying graph 



Let s/ be 2-letter n-state automaton. The scheme of the proof is as follows. 

Suppose j^ is not synchronizing; then there is an F-clique Fq. The number of 
states accessible from Fq is at least n/4e^ with probability 1 — 0(l/n). Indeed, 
the probability that there is a closed subset in automaton of size less than n/4e^ 



is bounded by 



n/4 
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For i < n/4e'^ we get that (f )* > 2{S^^y+'^. Hence the sum can be bounded 
by double first element 2(-). Take an arbitrary state p G Fq. The probability 
that p reaches less than 7i/4e^ states is at most 1 — 0{l/n). 

In section [5] we prove that for any c > the underlying graph of any letter 
has a unique highest tree and that this tree is higher than all other trees by c 
with probability l — 0{c/y^) and thus the probability for automata to have such 
tree for at least one letter (say a) is 1 — 0{c^/n). Note that the states of this tree 
(in particular, the set H of states which higher in the highest tree than states 
of other trees by c/2) are random for letter b. We want to prove that the subset 
H is reachable from Fq in automata with high probability, that is for some state 
s Cz Fq there is a word u which maps s to some state of H. If this subset would 
be random for the automaton it would be trivial. 

Using that for a random tree T of height h the number of states of level 
t < h,t — 0{h) is linear on t with high probability we prove that that there are 
at least d — 8\nn states in H. 

Since H is random for b there is a set Hb with at least d/2 states incoming 
to distinct states of H by letter b with probability 1 — o(l/n^). because the 
probability of the complement event can be bounded by 

K — (Jb I Z* ri, — (Jb I Zi 

where S(n^ n — k) is the Stirling number of the second kind. Indeed, for every 
possible size of Q.b we choose a subset of at least d/2 states in H in at most 2^^ 
ways and then the probability that this subset does not lie in Q\Q.b is equal 
(d/2) /(d/2)" Then we choose Q.b by (?) ways, define partition of Q in n— fc non- 
empty subsets and finally determine bijection between these subsets and Q.b in 
(n — fc)! ways. 

Hence all states of Hb has different images we can randomly add arbitrary 
number of states with the same images to get random set for b or equivalently 
suppose that the set Hb is already random for letter b. Thus the set Hb is random 
for each letter independently but can depend on its combination. This problem 
can be solved as follows. Take a random letter b with at least d/A states Hi 
incoming H by letter b, choose uniformly at random a subset H2 of size d/A and 
redefine outgoing edges uniformly at random to H. Then we get uniformly at 
random an automaton 3§ with at least d/2 states incoming H by letter 6, that 
is the automaton like ^. The probability that a subset Hb is reachable from Fq 
in this automaton is at least the probability that a random subset of d/A states 
are reachable from Fq. The probability of the complement event can be bounded 



Since H^ is reachable from Fq with probabihty 1 — o(l/n^), H is also reachable 
from i^o with probability 1 — o{l/n?). Thus with probability 1 — 0{l/n) there 
is a state s reachable from i^-clique i^o in the automaton s^ such that its level 
(in the digraph of letter a) is higher than levels of states from other trees by 
c/2. Using this and ideas from [9] in section [3] we prove that ^ has at least c/2 
pairwise distinct stable pairs which are random for letter h. 

Since these pairs are random for letter b it is easy to prove that there are 
n^/^ pairwise distinct stable pairs of states random for letter a with probability 
l-0(l/n). 

Using that the number of cycles of a random digraph is strongly concentrated 
near lnn/2 we shall get that a has at most In n cycles with probability 1 — 
0(l/n). 

The cycle vertices of a induce the state partition as follows. Two states p, q 
are in the same class if and only if for some positive integer d > we have 
p.h'^ — q.b'^. Thus for any cluster Ci with cycle of length I all states in Ci are 
partitioned in / — 0{y/n) classes. Then with high probability for each pair of 
(probably equal) big clusters Ci, C2 such that |Ci|, IC2I > v}^^ there are a lot of 
stable pairs of states pi,P2 such that pi lies in the cluster of Ci (for i G {1, 2}) 
and pi and p2 from different classes. Using that there are a lot of such pair for 
the case when clustes coincide we get that all pairs from the same clusters are 
stable, and using that there are pairs for each pairs of big clusters we get that 
all pairs from different clusters are also stable with probability 1 — 0(1 /n). This 
would immediately implies that all pairs from big clusters are synchronizing with 
probability 1 — 0{l/n) because they can be mapped to some cycle pair. Since 
there can be only few small clusters we get that there are at least n—o{n) pairwise 
synchronizing states with probability 1 — 0(l/n). This would contradicts the fact 
that i^o reaches n/Ae^ different states with probability 1 — 0{l/n). 

2 A probability to have a unique highest tree 

Theorem 1. For c — o{^/n) the probability that a random digraph from Sn has 
a unique highest tree and that this tree is higher than all other by c is equal 

We prove that the probability of negation event tends to 0, i.e. we prove that 
the probability that a random digraph g has at least two trees of height greater 
than height of g minus c tends to as 0{l/n^/r^). We almost repeat the proof 
of the limit distribution for the height of a random digraph from Sn from [3] . 

We need the following additional notations from [6 : 

— Fn jy denotes the probability space on the set of all (ordered) forests with 
n + N states and N root vertices and uniform probability distribution. 

— For X & Fn,N 01 X Q Sn denote by t{x) the height of x. 

— G^ denote the probability space of random critical branching processes, for 
^ e G^ and i > by fi{t) we denote the number of elements in this process 
at moment t, fi{0) — N and by !/(//) denote the number of elements in n 
after the degeneration moment t(/u) of the process /x. 



— Given a /i e G^ and i e {1,2,... N} denote by v'' the number of elements 
of i-th tree from /i, by vl the value of v^ under the condition t(i'') < i, i.e. 
P{vl = k) = P{v^ = k I T(i^') < t) and correspondingly by Dl the value of z/' 
under the condition T(i^*) = t. 

Theorem 2 (Kolchin [6j). For any x > 

+ 00 

/c— — oo 

It follows from this theorem that for a random digraph g G S^ the value of T{g) 
has order y^, namely 

P,eEA^^{rur2))^l-Sin,r,) (1) 

V '' 

where (5(ri,r2) is arbitrary small for ri, l/r2 small enough. 

Actually, the same is true for the number of rooted vertices X{g)- 

Pg^sJ^e{n,r2)) = l-S{n,r2) (2) 

where ^(^i, r2) is arbitrary small for ri, l/r2 small enough. 

It is clear that any digraph g € Sn with X{g) = N corresponds to the unique 
forest f{g) G Fn-N,N of its trees. Hence 

PaesArig) = k \ X{g) = N) = PfeF^^^A^f) - k). 

This implies that the height distribution for En can be averaged by A of the 
height distribution on Fn.N- In one's turn the height distribution on Fn-N,N 
can be considered as the distribution of the degeneration moment on G^ under 
condition that there are n elements after degeneration, namely. 



feF^- 



..Ar{f) = k) = P^eG«{r{^^) =k + l\ uip) = n). 



Slightly overload notations denote by /(/i) the forest we get after degener- 
ation of ^. Denote by Atq^t ^ G^ the subset of /x S G'^ such that /(/i) has at 
least one tree of height t, at least one tree of height between t and t + c and all 
other trees are lower than t + 1. 

We now briefly describe the changes we have to do in the proof from |6j . For 
a moment we consider only random branching processes jjl G G^ and thus will 
omit it as index of P. 

Lemma 1 (Kolchin [6j). If P{v{^) =n + N)>{) then for any t > 
P{T{fi) < t I v{^i) = n + TV) - {P{t{^i) < t)^) 



j^^P{:yN,t=n + N) 



I'N = n + N 
where VN,t = Hi=i K o-^d un = I]i=i ^'- 



If we replace P{t{ii) < t \ v{^) — n + N) in the above lemma by P{Apf^t \ v{ijl) 
n + N) we get 



P(Ajv,t I iy{tJ.) = n+N) = Oil)cN{N-l)PiT{iy'^-') = t)P{T{u'^) e [t,t+c])* 



Pirn ^n + N) 

Indeed, we choose tree of height t by N ways, then choose tree and its height 
between t and i + c by (N — 1) ways and replace it with Pi. Since all one-element 
processes of fi are independent and uniformly distributed we can choose numbers 
N -1,N and then muhiply by iV(iV - 1). 

Using that in the theorem N —^ oo and P{t{v'^) = t) = 2/t^{\ + o(l)) we can 
simplify this formulae as 



vn = n + N 



P{AN^t I v{^i) =n + N) = 0{l)cN' /t\P{T{v,) < t)^) 

We have erased —D^^^ — D^ because the probability that P* > \/N also tends 
to as iV -> oo. Thus we have got the same formulae at the right with an 
additional factor 0{l)cN'^/t^. 

Given x > let Bn,N,x C Fn^N be the set of all forests / of height at most 
Xy/n such that there are at least two trees of height t(/) — c. Then replacing 
event tf^ „ ^ x^/n with Bn^N,x we get the following analog of Theorem 3 of 
KolchinfB). 

Theorem 3. For any x > Q and n, A^ — > oo such that {) < zi < z ~ N/y/n < 

zi < CO we get 

PfeF^,{B^.N,,)=0{l){czyx^n)—= / e-'''-^^(^''')/^d,.(l + o(l)). 

zv27Tj-oo 

Finally, denote by Bn^x the set of all digraph g in £"„ such that f{g) G 

1 /•+00 (- + 00 

PgeE„{Bn,x) = ■■■ = {ceil)/x''n)^= / / z2g-».-./(x,.)/.^^^^ 

V27T j-oo Jo 

that is we have an additional factor 0{l)c/x'^n and an additional factor z^ in 
the integral. Using that 

-e-"(a2_2a + 2)|° - ^^ ^ 



fix, v) Jo 



f{x,v) °° f{x,v) 



we get the same expression as a result with the factor ^j^ and thus eventually 
we obtain that 



e{i)c 



+00 

22 , 



p...„(i?„,.) = ^ E(-i^-'^ 






Since Yl,k=-ooi'~'^)^^ '^ "^ /^ is a limit of probability Pg^s„{T{g)/^/n < x) which 
is positive constant we get 

Denote by -B„ C S^ the set of all digraphs g such that J{g) G i3n,A(g),a: for some 
X. Replacing t with x^/n we get 

[v^/d] 
■Pge2;„(Sn) = lini 2_, -fsei:„(-B„,A(5),t/\Ar) ^ 

t—dy'n ^ 

Thus the theorem is proved. D 

Since the letters for random automata are chosen independently the following 
corollary is straightforward. 

Corollary 1. For any integer constant c = o(y^ri)) the probability that a ran- 
dom t-letter automaton with n states has a unique highest tree for at least one 
letter and that this tree is higher than all other trees of this letter by c is 



3 Searching for stable pairs 

We need following definitions from [9] . A pair of states {p, q} is called deadlock 
if there is no word s such that p.s = q.s. A subset A C Q is called F-cliques 
of £/ if it is a maximal by size set such that each pair of states from A is 
deadlock. It follows from the definition that all F-cliques have the same size. Let 
us reformulate Lemma 2 in the following way. 

Lemma 2 (Trahtman [9j). Let A and B be two distinct F -cliques such that 
A\ B — p, B \ A — q for some pair of states p, q; Then p,q is a stable pair. 

Proof. Arguing by contradiction, suppose there is a word s such that {p.s, q.s} 
is deadlock. Then {A U B).s is an F-clique because all pairs are deadlock and 
|(A U -8)1 = |yl| + 1 > 1^1 because p.s 7^ q.s. This contradicts the maximality of 
A. 



Notice that the condition that the underlying graph of the automation is 
AGW graph is not used in Theorem 2 from 9 , this property is only used to 
prove that the underlying graph of letter has a unique highest tree. We only 
need in Theorem 2 from ^Qj that states from highest tree was reachable from 
some F-cliques. 

Theorem 4. The probability that a random 2-letter automaton have k — n^'^ 
stable pairs is 1 — 0(l/n). 

Proof. By Corollary [1] we get that there is a letter (say a) in the automaton 
si/ with highest tree T which is higher than all other trees by constant c with 
probability 1 — 0{l/n). By Lemma [5] there is a stable pair in this case and we 
can consider the underlying graph of letter a in the factor automaton by the 
stability relation. Since the relation is stable the heights of corresponding trees 
in the factor automaton 3§ can not increase and if the height of T the factor 
automaton is decreased by r then there are at least r stable pairs (connecting 
states of different levels) in £/. Thus there are at least r stable pairs and if r < c 
then the highest tree in 3§ is higher than all other trees by c — r. Hence by 
induction we get that ^ has at least c — r stable pairs. 

Since these c stable pairs are random for b each pair p, q is merged and does 
not intersect with other pairs by b with probability 1 — c/n in one step, the 
probability that p, q is not merged and does not intersect with other pairs for 
n^/^ steps is 1 — l/n^^^. Hence for c = 9 the probability that there is such a pair 
is 1 — 0{l/n). Thus the theorem is proved. D 

4 Reachable and strongly connected automata 

Theorem 5. The probability that 2-letter random automaton s/ with n states 
is not reachable is at least 0{l/n). 

Proof. Let .s^/ has exactly one disconnected state, that is the singleton weakly 
connected component. These automata can be counted as follows. We first choose 
a disconnected state in n ways (the transitions for this state is defined in 1 way) 
and then the number of different transitions for any other state is 

l(n - 2) + (n - 2)(n - 1) == n{n - 2) 

because other states are not disconnected. Thus the probability of such automata 
is equal 

n(n(n-2))"-i _ 1, 2,„_i 



-il--r-'=0{l/n) 



Thus the theorem is proved. 



Hence synchronzing automata is necessarly reachable we get that the main 
result. 

Theorem 6. The probability that a 2-letter random automaton with n states is 
synchronizing tends to 1 as 1 — 0{l/n). 
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